feat: add IngestTraces client for dedicated ingest service by pratduv · Pull Request #501 · rungalileo/galileo-python

pratduv · 2026-03-13T03:48:02Z

User description

Summary

Adds IngestTraces client class that communicates directly with the Go ingest service via httpx, bypassing the auto-generated API client
Conditionally activated when GALILEO_INGEST_URL is set; falls back to the existing Traces client otherwise
Wires the new client into all GalileoLogger ingestion paths (batch flush, streaming traces, streaming spans)

Changes

New: `IngestTraces` client (`src/galileo/traces.py`)

Standalone httpx.AsyncClient-based client targeting POST /ingest/traces/{project_id} and POST /ingest/spans/{project_id}
Resolves base URL from GALILEO_INGEST_URL env var, falling back to api_url
Handles auth headers (API key or JWT) independently
Uses @async_warn_catch_exception for resilient telemetry semantics

Modified: `GalileoLogger` (`src/galileo/logger/logger.py`)

Added _ingest_client field, created only when GALILEO_INGEST_URL is set
All ingestion call sites use self._ingest_client or self._traces_client fallback pattern
Refactored init into _ensure_project_and_log_stream, _create_traces_client, and _create_ingest_client for lazy creation

New routes (`src/galileo/constants/routes.py`)

ingest_traces = "/ingest/traces/{project_id}"
ingest_spans = "/ingest/spans/{project_id}"

Tests (`tests/test_ingest_traces_client.py`)

Constructor validation, URL resolution, auth header generation
respx-mocked HTTP tests for ingest_traces and ingest_spans
Logger integration: verifies client wiring for batch and streaming modes

Why

The Go ingest service runs at a separate URL from the main API and exposes raw HTTP endpoints (no OpenAPI client). This PR adds a thin httpx client so the SDK can route ingestion traffic directly to it when configured, without changing any behavior for users who don't set the env var.

Test plan

All new tests in test_ingest_traces_client.py pass
Pre-commit hooks (ruff, mypy) pass
Verify existing test suite still passes in CI

Fixes sc-58541

Made with Cursor

Generated description

Below is a concise technical summary of the changes proposed in this PR:
Describe how the SDK now targets the dedicated ingest service when GALILEO_INGEST_URL is configured while retaining existing APIs for batch/streaming ingestion in environments without that flag. Explain how the trace/span models (and the ADK builder) now use Logged* variants plus message/content-block helpers so multimodal inputs survive ingestion and are exercised by the logging tests.

Topic Details

Ingest routing

Describe how the logger now prefers the new IngestTraces client when the ingest endpoint env var is set, using the added routes and client wiring for both batch and streaming ingestion while falling back to the auto-generated Traces client otherwise. Include the new lazy creation helpers plus streaming ingestion overrides and the tests/test_ingest_traces_client.py coverage.

Modified files (4)

src/galileo/constants/routes.py
src/galileo/logger/logger.py
src/galileo/traces.py
tests/test_ingest_traces_client.py

Latest Contributors(2)

User	Commit	Date
calebe.sep@hotmail.com	fix-auto-convert-non-s...	March 02, 2026
mason@galileo.ai	feat-allow-session-met...	February 13, 2026

Multimodal schema

Describe how the schema and ADK layers now build on LoggedTrace/LoggedSpan/LoggedMessage plus the IngestContentBlock helpers so multimodal inputs/outputs stay typed, and how the ADK trace builder plus the batch logger tests verify that nothing is stringified when flushing.

Modified files (8)

galileo-adk/src/galileo_adk/trace_builder.py
src/galileo/schema/__init__.py
src/galileo/schema/content_blocks.py
src/galileo/schema/logged.py
src/galileo/schema/message.py
src/galileo/schema/trace.py
tests/schemas/test_logged.py
tests/test_logger_batch.py

Latest Contributors(2)

User	Commit	Date
calebe.sep@hotmail.com	fix-auto-convert-non-s...	March 02, 2026
mason@galileo.ai	feat-allow-session-met...	February 13, 2026

This pull request is reviewed by Baz. Review like a pro on (Baz).

baz-reviewer · 2026-03-13T03:52:21Z

src/galileo/traces.py

+    async def ingest_traces(self, traces_ingest_request: TracesIngestRequest) -> dict[str, Any]:
+        if self.experiment_id:
+            traces_ingest_request.experiment_id = UUID(self.experiment_id)
+        elif self.log_stream_id:
+            traces_ingest_request.log_stream_id = UUID(self.log_stream_id)


IngestTraces.ingest_traces repeats almost the same flow as ingest_spans: set the experiment/log stream IDs, flip on LoggingMethod.python_client, build the ingest URL, log the payload size, and post via an httpx.AsyncClient. The only real differences between the two methods are which route/attribute they use and the log message. Any future change to headers, timeout, metrics, or logging therefore needs to be made twice in blocks that are otherwise identical. Can we extract a shared helper (e.g. _post_to_ingest(endpoint: str, payload_attr: str, log_message: str, request: BaseModel)) that handles the ID injection, LoggingMethod assignment, URL construction, httpx request, and logging, and then let ingest_traces/ingest_spans simply call it with the different route and attribute? That keeps the helper consistent (shown below) and leaves the call sites tiny:

async def _post_to_ingest(...): if self.experiment_id: request.experiment_id = UUID(self.experiment_id) elif self.log_stream_id: request.log_stream_id = UUID(self.log_stream_id) request.logging_method = LoggingMethod.python_client url = f"{self._get_ingest_base_url()}{endpoint}" _logger.info(log_message, extra={"url": url, "project_id": self.project_id, "num_items": len(getattr(request, payload_attr))}) async with httpx.AsyncClient(...) as client: response = await client.post(url, json=request.model_dump(mode="json"), headers=self._get_auth_headers()) response.raise_for_status() return response.json()

async def ingest_traces(...): return await self._post_to_ingest(Routes.ingest_traces.format(...), "traces", "Sending traces to ingest service", traces_ingest_request) async def ingest_spans(...): return await self._post_to_ingest(Routes.ingest_spans.format(...), "spans", "Sending spans to ingest service", spans_ingest_request)

This avoids the same control flow being duplicated across the two methods.

_{Finding type: Code Dedup and Conventions | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

baz-reviewer · 2026-03-13T03:52:21Z

src/galileo/traces.py

+        _logger.info(
+            "Sending traces to ingest service",
+            extra={"url": url, "project_id": self.project_id, "num_traces": len(traces_ingest_request.traces)},
+        )
+


ingest_traces logs the "Sending traces to ingest service" start event but never logs success or failure after httpx.AsyncClient.post/response.raise_for_status. This violates AGENTS.md's "Logging & sensitive-data handling" lifecycle requirement and leaves ingest writes unobservable. Can we add a completion log (success or failure with non-sensitive context) immediately after the POST/response.raise_for_status?

_{Finding type: AI Coding Guidelines | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In src/galileo/traces.py around lines 227 to 231, the ingest_traces method only logs the start of the request but never logs success or failure after the HTTP POST. Modify ingest_traces to wrap the httpx post/response.raise_for_status sequence in a try/except: after a successful response.raise_for_status() emit an _logger.info() completion log with non-sensitive context (url, project_id, num_traces, response.status_code); on exceptions (catch httpx.HTTPStatusError and a generic Exception) emit an _logger.error() with safe context (url, project_id, num_traces, status code or exception message trimmed) without logging request/response bodies or auth headers, then re-raise the exception to preserve existing error behavior.

baz-reviewer · 2026-03-13T03:52:21Z

src/galileo/traces.py

+        _logger.info(
+            "Sending spans to ingest service",
+            extra={"url": url, "project_id": self.project_id, "num_spans": len(spans_ingest_request.spans)},
+        )
+


ingest_spans only logs 'Sending spans to ingest service' before the HTTP POST and never emits a completion log. AGENTS.md requires start+completion lifecycle logs so operators lack success/failure context; can we add a completion log after response.raise_for_status()/on success that doesn't include sensitive data?

_{Finding type: AI Coding Guidelines | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In src/galileo/traces.py around lines 252 to 256, the ingest_spans method only logs the start of the HTTP request and never emits a completion/success log. Add a safe completion log immediately after response.raise_for_status() and before returning response.json(), e.g. call _logger.info("Finished sending spans to ingest service", extra={"url": url, "project_id": self.project_id, "num_spans": len(spans_ingest_request.spans), "status_code": response.status_code}). Do not log response body, headers, auth, or any sensitive data. Keep the rest of the method unchanged.

baz-reviewer · 2026-03-13T04:21:41Z

src/galileo/logger/logger.py

+            client = self._ingest_client or self._traces_client
+            await client.ingest_traces(traces_ingest_request)


Flush in batch mode can keep using the traces client instead of the dedicated ingest client when GALILEO_INGEST_URL is set after constructing GalileoLogger. Flush selects client via client = self._ingest_client or self._traces_client and immediately calls await client.ingest_traces(...) (lines 1929-1930), but _ingest_client is only created in __init__ so it stays None; can we call _create_ingest_client() before selecting the client (as async_ingest_traces does around 2124-2130) and apply the same lazy-creation fix to other similar call sites?

_{Finding type: Logical Bugs | Severity: 🔴 High}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In src/galileo/logger/logger.py around lines 1929-1930 (method _flush_batch), the code picks client = self._ingest_client or self._traces_client and then calls await client.ingest_traces(...). This fails if GALILEO_INGEST_URL was set after the GalileoLogger was constructed because _ingest_client was never created. Change the selection to lazily create the ingest client first (e.g. if self._ingest_client is None and os.environ.get('GALILEO_INGEST_URL'): self._ingest_client = self._create_ingest_client()), then set client = self._ingest_client or self._traces_client (and ensure _traces_client is created if needed). Also update the similar client-selection logic in _ingest_span_streaming (lines ~550-593) and _ingest_trace_streaming (lines ~516-542) to lazily create _ingest_client before using it so the dedicated ingest service is used if the env var is present.

baz-reviewer · 2026-03-13T04:21:42Z

tests/test_ingest_traces_client.py

+    @patch("galileo.traces.GalileoPythonConfig")
+    def test_accepts_log_stream_id(self, mock_config_class) -> None:
+        mock_config_class.get.return_value = Mock()
+
+        client = IngestTraces(project_id=PROJECT_ID, log_stream_id=LOG_STREAM_ID)


test_accepts_log_stream_id and other methods in TestIngestTracesInit omit the sentence-case # Given:/When:/Then: comments required by AGENTS.md, so these tests deviate from the documented testing convention and reduce clarity. Can we add non-empty sentence-case # Given:/When:/Then: comments to each test in tests/?

_{Finding type: AI Coding Guidelines | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In tests/test_ingest_traces_client.py around lines 35 to 39 (and similarly for the other test methods in this file such as those in TestIngestTracesInit and the request test classes), the test methods lack the required sentence-case `# Given:/When:/Then:` comments per AGENTS.md. Edit each test function to add three short, sentence-case comments: `# Given:` describing the test setup, `# When:` describing the action under test, and `# Then:` describing the expected outcome (do not leave any of them empty). Keep existing code and assertions unchanged aside from inserting these comments in the appropriate places immediately above the relevant code blocks.

baz-reviewer · 2026-03-13T04:21:42Z

tests/test_ingest_traces_client.py

+    @respx.mock
+    @pytest.mark.asyncio
+    async def test_ingest_traces_posts_to_correct_url(self, client, monkeypatch) -> None:
+        # Given: an ingest URL is configured
+        monkeypatch.setenv("GALILEO_INGEST_URL", INGEST_URL)
+


TestIngestTracesRequest uses @respx.mock/httpx instead of the shared mock_request fixture required by AGENTS.md (lines 207–235) and tests/conftest.py. This bypasses the shared setup (including --disable-socket) and breaks the documented mocking convention; can we refactor these tests to use mock_request and the accompanying mocks instead of wiring respx manually?

_{Finding type: AI Coding Guidelines | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In tests/test_ingest_traces_client.py around lines 148 to 153, the tests in TestIngestTracesRequest use @respx.mock and respx.post to stub HTTP calls which violates AGENTS.md and bypasses the shared mock_request fixture. Refactor these tests to remove @respx.mock and respx usage: have each async test accept the mock_request fixture (add mock_request to the parameter list), use that fixture to register the expected ingest URL and canned httpx response (matching the JSON bodies used today), and assert route calls via the mock_request tracking API instead of respx. Apply the same replacement to the other tests in TestIngestTracesRequest and TestIngestSpansRequest that use respx so all HTTP interactions reuse mock_request and respect --disable-socket.

Adds a new IngestTraces client that talks directly to the Go ingest service via httpx, activated when GALILEO_INGEST_URL is set. Falls back to the existing Traces client (main API) otherwise. [sc-58541]

Do not serialize trace input/output to string; preserve multimodal content. Add test_multimodal_input_not_stringified_at_trace_level.

baz-reviewer · 2026-03-17T04:21:49Z

src/galileo/logger/logger.py

        trace = LoggedTrace(
-            input=serialize_to_str(input),
-            redacted_input=serialize_to_str(redacted_input) if redacted_input else None,
-            output=serialize_to_str(output),
-            redacted_output=serialize_to_str(redacted_output) if redacted_output else None,
+            input=input,
+            redacted_input=redacted_input,
+            output=output,
+            redacted_output=redacted_output,


add_single_llm_span_trace now constructs LoggedTrace by passing input/output/redacted_input/redacted_output directly (new lines 1034–1038), but LoggedTrace.input/output only accept str or Sequence[LoggedMessage] and the previous implementation serialized dicts/core Message via serialize_to_str. Callers that pass dicts or Message objects will fail validation before ingestion. Can we restore trace-level serialization/normalization before constructing LoggedTrace, or else narrow the API to only accept str/Sequence[LoggedMessage], and review related helpers such as add_llm_span for the same regression?

_{Finding type: Breaking Changes | Severity: 🔴 High}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In src/galileo/logger/logger.py around lines 1034 to 1038, the add_single_llm_span_trace method now constructs LoggedTrace by passing input/output/redacted_input/redacted_output directly, which breaks validation because LoggedTrace expects str or Sequence[LoggedMessage]. Restore the previous trace-level serialization by calling serialize_to_str(input) and serialize_to_str(output) (and serialize_to_str(redacted_input)/serialize_to_str(redacted_output) when present) before creating LoggedTrace, while keeping the full structured values for the child LoggedLlmSpan as currently implemented. Also search nearby helper methods that used serialize_to_str (e.g., add_llm_span/add_span) and ensure consistent normalization behavior so callers passing dicts or Message objects continue to work.

baz-reviewer · 2026-03-18T13:51:57Z

galileo-adk/src/galileo_adk/trace_builder.py

+        if self.current_parent() is not None:
+            raise ValueError("You must conclude the existing trace before adding a new one.")
+        trace = LoggedTrace(
+            input=input,
+            redacted_input=redacted_input,


The guard/LoggedTrace constructor here duplicates all of GalileoLogger.add_trace (src/galileo/logger/logger.py lines 425‑463) – the same current_parent() check, metrics/dataset defaults, and parent tracking exist in both classes. Any future change to the LoggedTrace contract (metadata conversion, dataset fields, parent management) will now require edits in two places. Could we share a helper (e.g. in TracesLogger or a small builder) so the hook-mode builder and the main logger reuse the same trace construction logic?

_{Finding type: Code Dedup and Conventions | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

baz-reviewer · 2026-03-18T13:51:57Z

galileo-adk/src/galileo_adk/trace_builder.py

+        span = LoggedWorkflowSpan(
            input=input,
            redacted_input=redacted_input,
            output=output,
            redacted_output=redacted_output,


The LoggedWorkflowSpan construction here mirrors GalileoLogger.add_workflow_span (see src/galileo/logger/logger.py lines 1473‑1550) almost verbatim – same metadata conversion, metrics, UUID creation, parent wiring, and status handling. Because the hook-mode builder and the main logger both need to keep this span shape and parent manipulation in sync, can we move the shared span creation/parent tracking into a helper (or re‑use GalileoLogger._attach_parentable_span) so we don’t duplicate the same 10+ lines in two modules?

_{Finding type: Code Dedup and Conventions | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

baz-reviewer · 2026-03-18T13:51:57Z

src/galileo/traces.py

+            response = await client.post(url, json=json_body, headers=self._get_auth_headers())
+            response.raise_for_status()
+            return response.json()
+


response.json() can raise JSONDecodeError for empty/non-JSON 200/204 responses; should we catch JSONDecodeError and translate it into a controlled SDK error or default payload?

_{Finding type: Logical Bugs | Severity: 🔴 High}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In src/galileo/traces.py around lines 233 to 236, the ingest_traces method calls response.json() directly after response.raise_for_status(), which will raise json.JSONDecodeError if the ingest service returns an empty or non-JSON body. Wrap the call to response.json() in a try/except catching json.JSONDecodeError (import json or JSONDecodeError), log a warning including the response.status_code and response.text, and return a safe default payload (e.g. an empty dict) or raise a controlled SDK-specific error instead of letting the JSONDecodeError escape. Apply the same defensive change to the ingest_spans method at the analogous block (around lines 256 to 258) so both endpoints behave consistently.

baz-reviewer · 2026-03-18T13:51:58Z

src/galileo/schema/logged.py

+IngestInputType = Union[str, Sequence[LoggedMessage]]
+IngestOutputType = Union[str, LoggedMessage, Sequence[Document]]
+
+_INPUT_FIELD = Field(default="", description=BaseStep.model_fields["input"].description, union_mode="left_to_right")
+_REDACTED_INPUT_FIELD = Field(
+    default=None, description=BaseStep.model_fields["redacted_input"].description, union_mode="left_to_right"
+)
+_OUTPUT_FIELD = Field(default=None, description=BaseStep.model_fields["output"].description, union_mode="left_to_right")
+_REDACTED_OUTPUT_FIELD = Field(


start_trace advertises Message input but lacks conversion to LoggedMessage — should we add a trace-level conversion helper?

_{Finding type: Logical Bugs | Severity: 🔴 High}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

In src/galileo/schema/logged.py around lines 30-38, the IngestInputType and LoggedTrace.input currently only accept str or Sequence[LoggedMessage], but callers may pass core Message objects. Add conversion helpers and a model-level coercion so LoggedTrace will accept core Message/dict inputs and convert them into LoggedMessage instances before validation. Concretely: implement a classmethod on LoggedTrace similar to LoggedLlmSpan._to_logged_message and _convert_dict_to_message, then add a Pydantic root validator or model_post_init that walks input and redacted_input and replaces any Message or dict entries with LoggedMessage via those helpers (also consider doing the same for output/redacted_output if they can be Message types). This preserves the documented start_trace behavior and prevents Pydantic validation errors.

pratduv requested a review from a team as a code owner March 13, 2026 03:48

pratduv requested review from anurag-lang and removed request for a team and anurag-lang March 13, 2026 03:48

baz-reviewer bot reviewed Mar 13, 2026

View reviewed changes

pratduv force-pushed the sc-58230/local-ingest-models branch from 1a29b4f to 2405b0e Compare March 13, 2026 04:12

pratduv force-pushed the sc-58230/ingest-client branch from cce7d18 to 4d196b7 Compare March 13, 2026 04:16

baz-reviewer bot reviewed Mar 13, 2026

View reviewed changes

pratduv force-pushed the sc-58230/local-ingest-models branch 5 times, most recently from 1875ca6 to c9eb90e Compare March 13, 2026 23:45

feat: Creating custom input and output types for multimodal ingestion

f2c95c6

pratduv force-pushed the sc-58230/local-ingest-models branch from c9eb90e to f2c95c6 Compare March 14, 2026 00:11

pratduv added 2 commits March 16, 2026 13:41

feat: add IngestTraces client for dedicated ingest service

8d1c596

Adds a new IngestTraces client that talks directly to the Go ingest service via httpx, activated when GALILEO_INGEST_URL is set. Falls back to the existing Traces client (main API) otherwise. [sc-58541]

fix: pass trace-level I/O through without stringifying; add test

2123eae

Do not serialize trace input/output to string; preserve multimodal content. Add test_multimodal_input_not_stringified_at_trace_level.

pratduv force-pushed the sc-58230/ingest-client branch from 4d196b7 to 2123eae Compare March 17, 2026 04:13

baz-reviewer bot reviewed Mar 17, 2026

View reviewed changes

Base automatically changed from sc-58230/local-ingest-models to main March 18, 2026 13:43

baz-reviewer bot reviewed Mar 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add IngestTraces client for dedicated ingest service#501

feat: add IngestTraces client for dedicated ingest service#501
pratduv wants to merge 3 commits intomainfrom
sc-58230/ingest-client

pratduv commented Mar 13, 2026 •

edited by baz-reviewer bot

Loading

Uh oh!

baz-reviewer bot Mar 13, 2026

Uh oh!

baz-reviewer bot Mar 13, 2026

Uh oh!

baz-reviewer bot Mar 13, 2026

Uh oh!

baz-reviewer bot Mar 13, 2026

Uh oh!

baz-reviewer bot Mar 13, 2026

Uh oh!

baz-reviewer bot Mar 13, 2026

Uh oh!

baz-reviewer bot Mar 17, 2026

Uh oh!

baz-reviewer bot Mar 18, 2026

Uh oh!

baz-reviewer bot Mar 18, 2026

Uh oh!

baz-reviewer bot Mar 18, 2026

Uh oh!

baz-reviewer bot Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		client = self._ingest_client or self._traces_client
		await client.ingest_traces(traces_ingest_request)

Conversation

pratduv commented Mar 13, 2026 • edited by baz-reviewer bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Summary

Changes

New: IngestTraces client (src/galileo/traces.py)

Modified: GalileoLogger (src/galileo/logger/logger.py)

New routes (src/galileo/constants/routes.py)

Tests (tests/test_ingest_traces_client.py)

Why

Test plan

Generated description

Uh oh!

baz-reviewer bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

baz-reviewer bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pratduv commented Mar 13, 2026 •

edited by baz-reviewer bot

Loading

New: `IngestTraces` client (`src/galileo/traces.py`)

Modified: `GalileoLogger` (`src/galileo/logger/logger.py`)

New routes (`src/galileo/constants/routes.py`)

Tests (`tests/test_ingest_traces_client.py`)